Goto

Collaborating Authors

 Newport News


marge__neurips_final_ (2)

Michael Lewis

Neural Information Processing Systems

MARGE performs comparably to XLM-R, but with significant variation across languages. We only show results for languages in all model's Table 8: Number of documents per language used for pre-training. Katherine G. Johnson (née Coleman; August 26, 1918 - February 24, 2020) was an She contributed to the science of the U.S. Air Force and space programs,



Standardization of Psychiatric Diagnoses -- Role of Fine-tuned LLM Consortium and OpenAI-gpt-oss Reasoning LLM Enabled Decision Support System

Bandara, Eranga, Gore, Ross, Yarlagadda, Atmaram, Clayton, Anita H., Samuel, Preston, Rhea, Christopher K., Shetty, Sachin

arXiv.org Artificial Intelligence

The diagnosis of most mental disorders, including psychiatric evaluations, primarily depends on dialogues between psychiatrists and patients. This subjective process can lead to variability in diagnoses across clinicians and patients, resulting in inconsistencies and challenges in achieving reliable outcomes. To address these issues and standardize psychiatric diagnoses, we propose a Fine-Tuned Large Language Model (LLM) Consortium and OpenAI-gpt-oss Reasoning LLM-enabled Decision Support System for the clinical diagnosis of mental disorders. Our approach leverages fine-tuned LLMs trained on conversational datasets involving psychiatrist-patient interactions focused on mental health conditions (e.g., depression). The diagnostic predictions from individual models are aggregated through a consensus-based decision-making process, refined by the OpenAI-gpt-oss reasoning LLM. We propose a novel method for deploying LLM agents that orchestrate communication between the LLM consortium and the reasoning LLM, ensuring transparency, reliability, and responsible AI across the entire diagnostic workflow. Experimental results demonstrate the transformative potential of combining fine-tuned LLMs with a reasoning model to create a robust and highly accurate diagnostic system for mental health assessment. A prototype of the proposed platform, integrating three fine-tuned LLMs with the OpenAI-gpt-oss reasoning LLM, was developed in collaboration with the U.S. Army Medical Research Team in Norfolk, Virginia, USA. To the best of our knowledge, this work represents the first application of a fine-tuned LLM consortium integrated with a reasoning LLM for clinical mental health diagnosis paving the way for next-generation AI-powered eHealth systems aimed at standardizing psychiatric diagnoses.




No Soundness in the Real World: On the Challenges of the Verification of Deployed Neural Networks

Szász, Attila, Bánhelyi, Balázs, Jelasity, Márk

arXiv.org Artificial Intelligence

The ultimate goal of verification is to guarantee the safety of deployed neural networks. Here, we claim that all the state-of-the-art verifiers we are aware of fail to reach this goal. Our key insight is that theoretical soundness (bounding the full-precision output while computing with floating point) does not imply practical soundness (bounding the floating point output in a potentially stochastic environment). We prove this observation for the approaches that are currently used to achieve provable theoretical soundness, such as interval analysis and its variants. We also argue that achieving practical soundness is significantly harder computationally. We support our claims empirically as well by evaluating several well-known verification methods. To mislead the verifiers, we create adversarial networks that detect and exploit features of the deployment environment, such as the order and precision of floating point operations. We demonstrate that all the tested verifiers are vulnerable to our new deployment-specific attacks, which proves that they are not practically sound.


Geometric GNNs for Charged Particle Tracking at GlueX

Mohammed, Ahmed Hossam, Rajput, Kishansingh, Taylor, Simon, Furletov, Denis, Furletov, Sergey, Schram, Malachi

arXiv.org Artificial Intelligence

Nuclear physics experiments are aimed at uncovering the fundamental building blocks of matter. The experiments involve high-energy collisions that produce complex events with many particle trajectories. Tracking charged particles resulting from collisions in the presence of a strong magnetic field is critical to enable the reconstruction of particle trajectories and precise determination of interactions. It is traditionally achieved through combinatorial approaches that scale worse than linearly as the number of hits grows. Since particle hit data naturally form a 3-dimensional point cloud and can be structured as graphs, Graph Neural Networks (GNNs) emerge as an intuitive and effective choice for this task. In this study, we evaluate the GNN model for track finding on the data from the GlueX experiment at Jefferson Lab. We use simulation data to train the model and test on both simulation and real GlueX measurements. We demonstrate that GNN-based track finding outperforms the currently used traditional method at GlueX in terms of segment-based efficiency at a fixed purity while providing faster inferences. We show that the GNN model can achieve significant speedup by processing multiple events in batches, which exploits the parallel computation capability of Graphical Processing Units (GPUs). Finally, we compare the GNN implementation on GPU and FPGA and describe the trade-off.


Proof-of-TBI -- Fine-Tuned Vision Language Model Consortium and OpenAI-o3 Reasoning LLM-Based Medical Diagnosis Support System for Mild Traumatic Brain Injury (TBI) Prediction

Gore, Ross, Bandara, Eranga, Shetty, Sachin, Musto, Alberto E., Rana, Pratip, Valencia-Romero, Ambrosio, Rhea, Christopher, Tayebi, Lobat, Richter, Heather, Yarlagadda, Atmaram, Edmonds, Donna, Wallace, Steven, Broshek, Donna

arXiv.org Artificial Intelligence

Mild Traumatic Brain Injury (TBI) detection presents significant challenges due to the subtle and often ambiguous presentation of symptoms in medical imaging, making accurate diagnosis a complex task. To address these challenges, we propose Proof-of-TBI, a medical diagnosis support system that integrates multiple fine-tuned vision-language models with the OpenAI-o3 reasoning large language model (LLM). Our approach fine-tunes multiple vision-language models using a labeled dataset of TBI MRI scans, training them to diagnose TBI symptoms effectively. The predictions from these models are aggregated through a consensus-based decision-making process. The system evaluates the predictions from all fine-tuned vision language models using the OpenAI-o3 reasoning LLM, a model that has demonstrated remarkable reasoning performance, to produce the most accurate final diagnosis. The LLM Agents orchestrates interactions between the vision-language models and the reasoning LLM, managing the final decision-making process with transparency, reliability, and automation. This end-to-end decision-making workflow combines the vision-language model consortium with the OpenAI-o3 reasoning LLM, enabled by custom prompt engineering by the LLM agents. The prototype for the proposed platform was developed in collaboration with the U.S. Army Medical Research team in Newport News, Virginia, incorporating five fine-tuned vision-language models. The results demonstrate the transformative potential of combining fine-tuned vision-language model inputs with the OpenAI-o3 reasoning LLM to create a robust, secure, and highly accurate diagnostic system for mild TBI prediction. To the best of our knowledge, this research represents the first application of fine-tuned vision-language models integrated with a reasoning LLM for TBI prediction tasks.


Sen. Tim Kaine 'very frustrated' by lack of answers on drone incursions at Langley Air Force Base

FOX News

Sen. Tim Kaine, D-Va., tells Fox News Digital he's frustrated by U.S. officials not being forthcoming about the drone incursions over Langley Air Force Base. Nearly one year after mysterious drones hovered near a top-secret military base in Virginia for 17 days, Sen. Tim Kaine says he is "very frustrated" with "so many unanswered questions" that remain. The Virginia Democrat said his state delegation will get a classified briefing on the situation Thursday. For more than two weeks in December 2023, the mystery drones flew into restricted airspace over the installation, home to key national security sites and the F-22 Raptor stealth fighters. The Pentagon has said little about the incidents other than to confirm they occurred after a Wall Street Journal report in October.


Data-Driven Gradient Optimization for Field Emission Management in a Superconducting Radio-Frequency Linac

Goldenberg, Steven, Ahammed, Kawser, Carpenter, Adam, Li, Jiang, Suleiman, Riad, Tennant, Chris

arXiv.org Artificial Intelligence

However, since the energy upgrade, CEBAF has suffered from significant FE induced radiation. With RF on, dose Jefferson Lab's Continuous Electron Beam Accelerator rates observed at 30 cm from the beamline are as high Facility (CEBAF) [1] relies on two superconducting as 10 rem/h and 100 rem/h for neutron and gamma radiation, radio-frequency linear accelerators (SRF linacs) to deliver respectively. This level of radiation causes significant high-energy electron beams to nuclear physics experiments damage to beamline components, including vacuum in the four experimental halls [2]. An integral valves, magnets, and cables of beam position monitors part of these linacs are cryomodules which contain and ion pumps. Replacing these components can use multiple SRF cavities. These SRF cavities provide the significant resources. Worse, portions of both linacs are main accelerating gradients to the electron beam, and considered "Radiation Areas" for days or even weeks into currently produce the 12 GeV beam necessary for scientific scheduled downtime, limiting maintenance activities to discovery.